AITopics | blue box

Collaborating Authors

blue box

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection - Supplemental Material - 1 More Ablation Studies 1.1 Number of test categories in evaluation

Neural Information Processing SystemsFeb-17-2026, 15:20:08 GMT

The distillation can also cover 3D object boxes on the background.

artificial intelligence, category, machine learning, (16 more...)

Neural Information Processing Systems

Industry: Information Technology > Security & Privacy (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Security & Privacy (0.94)
Information Technology > Artificial Intelligence > Vision (0.67)

Add feedback

CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection - Supplemental Material - 1 More Ablation Studies 1.1 Number of test categories in evaluation

Neural Information Processing SystemsOct-9-2025, 10:01:50 GMT

The distillation can also cover 3D object boxes on the background.

artificial intelligence, category, machine learning, (16 more...)

Neural Information Processing Systems

Industry: Information Technology > Security & Privacy (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Security & Privacy (0.94)
Information Technology > Artificial Intelligence > Vision (0.67)

Add feedback

Representations of Fact, Fiction and Forecast in Large Language Models: Epistemics and Attitudes

Li, Meng, Vrazitulis, Michael, Schlangen, David

arXiv.org Artificial IntelligenceJun-3-2025

Rational speakers are supposed to know what they know and what they do not know, and to generate expressions matching the strength of evidence. In contrast, it is still a challenge for current large language models to generate corresponding utterances based on the assessment of facts and confidence in an uncertain real-world environment. While it has recently become popular to estimate and calibrate confidence of LLMs with verbalized uncertainty, what is lacking is a careful examination of the linguistic knowledge of uncertainty encoded in the latent space of LLMs. In this paper, we draw on typological frameworks of epistemic expressions to evaluate LLMs' knowledge of epistemic modality, using controlled stories. Our experiments show that the performance of LLMs in generating epistemic expressions is limited and not robust, and hence the expressions of uncertainty generated by LLMs are not always reliable. To build uncertainty-aware LLMs, it is necessary to enrich semantic knowledge of epistemic modality in LLMs.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2506.01512

Country:

North America > United States (0.46)
Europe > Austria (0.28)

Genre:

Research Report > Experimental Study (0.70)
Research Report > New Finding (0.48)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.31)

Add feedback

From Goal-Conditioned to Language-Conditioned Agents via Vision-Language Models

Cachet, Theo, Dance, Christopher R., Sigaud, Olivier

arXiv.org Artificial IntelligenceNov-26-2024

Vision-language models (VLMs) have tremendous potential for grounding language, and thus enabling language-conditioned agents (LCAs) to perform diverse tasks specified with text. This has motivated the study of LCAs based on reinforcement learning (RL) with rewards given by rendering images of an environment and evaluating those images with VLMs. If single-task RL is employed, such approaches are limited by the cost and time required to train a policy for each new task. Multi-task RL (MTRL) is a natural alternative, but requires a carefully designed corpus of training tasks and does not always generalize reliably to new tasks. Therefore, this paper introduces a novel decomposition of the problem of building an LCA: first find an environment configuration that has a high VLM score for text describing a task; then use a (pretrained) goal-conditioned policy to reach that configuration. We also explore several enhancements to the speed and quality of VLM-based LCAs, notably, the use of distilled models, and the evaluation of configurations from multiple viewpoints to resolve the ambiguities inherent in a single 2D view. We demonstrate our approach on the Humanoid environment, showing that it results in LCAs that outperform MTRL baselines in zero-shot generalization, without requiring any textual task descriptions or other forms of environment-specific annotation during training. Videos and an interactive demo can be found at https://europe.naverlabs.com/text2control

configuration, goal-conditioned, language-conditioned agent, (14 more...)

arXiv.org Artificial Intelligence

2409.16024

Country:

Europe > Austria > Vienna (0.14)
South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.04)
North America > United States > Massachusetts > Middlesex County > Reading (0.04)

Genre: Research Report (0.63)

Industry: Leisure & Entertainment > Sports (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Text2Motion: From Natural Language Instructions to Feasible Plans

Lin, Kevin, Agia, Christopher, Migimatsu, Toki, Pavone, Marco, Bohg, Jeannette

arXiv.org Artificial IntelligenceNov-26-2023

We propose Text2Motion, a language-based planning framework enabling robots to solve sequential manipulation tasks that require long-horizon reasoning. Given a natural language instruction, our framework constructs both a task- and motion-level plan that is verified to reach inferred symbolic goals. Text2Motion uses feasibility heuristics encoded in Q-functions of a library of skills to guide task planning with Large Language Models. Whereas previous language-based planners only consider the feasibility of individual skills, Text2Motion actively resolves geometric dependencies spanning skill sequences by performing geometric feasibility planning during its search. We evaluate our method on a suite of problems that require long-horizon reasoning, interpretation of abstract goals, and handling of partial affordance perception. Our experiments show that Text2Motion can solve these challenging problems with a success rate of 82%, while prior state-of-the-art language-based planning methods only achieve 13%. Text2Motion thus provides promising generalization characteristics to semantically diverse sequential manipulation tasks with geometric dependencies between skills.

blue box, sequence, text2motion, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/s10514-023-10131-7

2303.12153

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > Michigan (0.04)
North America > United States > Massachusetts (0.04)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)
(2 more...)

Add feedback

Learning Action Duration and Synergy in Task Planning for Human-Robot Collaboration

Sandrini, Samuele, Faroni, Marco, Pedrocchi, Nicola

arXiv.org Artificial IntelligenceOct-20-2022

A good estimation of the actions' cost is key in task planning for human-robot collaboration. The duration of an action depends on agents' capabilities and the correlation between actions performed simultaneously by the human and the robot. This paper proposes an approach to learning actions' costs and coupling between actions executed concurrently by humans and robots. We leverage the information from past executions to learn the average duration of each action and a synergy coefficient representing the effect of an action performed by the human on the duration of the action performed by the robot (and vice versa). We implement the proposed method in a simulated scenario where both agents can access the same area simultaneously. Safety measures require the robot to slow down when the human is close, denoting a bad synergy of tasks operating in the same area. We show that our approach can learn such bad couplings so that a task planner can leverage this information to find better plans.

artificial intelligence, information, robot, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ETFA52439.2022.9921721

2210.1166

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Italy > Campania > Naples (0.04)
Asia > Japan > Honshū > Kansai > Hyogo Prefecture > Kobe (0.04)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (1.00)

Add feedback

Getting Connected with Google Home Using API.AI & Talend

#artificialintelligenceJul-7-2017, 23:36:02 GMT

"OK Google, what can you do when connected to Talend?" In this tutorial, I will show how to create an Agent in API.AI that will respond to commands spoken to Google Home. The Agent will reverse the words in a sentence spoken to Google Home by making use of a Talend web service which is used to carry out the word reversal. A very simple example, but it demonstrates the ground work you will need to create some really quite interesting applications. You do not need one to try this tutorial out as Google has provided an emulator, but I can highly recommend the device. Recently Google opened up access to the Actions on Google API. You can either use the Actions SDK or use API.AI. API.AI was recently acquired by Google. While API.AI is really quite simple to use, it is quite limited in how it can be used with Google Home at the moment.

artificial intelligence, chatbot, natural language, (18 more...)

#artificialintelligence

Country:

Europe (0.05)
Oceania > Australia (0.04)
North America (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)

Add feedback

Introducing CS to Newcomers, and JES As a Teaching Tool

Communications of the ACMOct-29-2016, 04:36:34 GMT

I had an interesting experience recently. I agreed to run a session on computer science for the STEP (Science and Technology Entry Program) students at Union College's Kenney Community Center. The range of students was large, from 7th to 12th grade. Usually in a session like this I start by asking two things. First, in what ways are computers already in their lives?

artificial intelligence, computer, python, (15 more...)

Communications of the ACM

Country:

North America > United States > New York > Schenectady County > Schenectady (0.05)
North America > United States > Georgia > Fulton County > Atlanta (0.05)

Industry: Education > Educational Setting > Higher Education (0.35)

Technology: Information Technology > Artificial Intelligence (0.71)

Add feedback

Practice Programming Through Play

#artificialintelligenceSep-6-2016, 01:50:13 GMT

Solving puzzles through your own powers of thought gives a certain kind of satisfaction that is especially rewarding. Games like Sudoku, Tetris, and Rubik's Cube are great for strengthening mathematical thinking and visual-spacial intelligence. Nowadays we seem to have an endless supply of puzzle games on mobile devices to keep our minds occupied during all of the spare moments of the day. It's fine to use puzzle games to fill up the empty spaces of time, but I've found some games that entice me to go much deeper. Lately I've been getting into games geared towards introducing kids to programming concepts.

artificial intelligence, instruction, prog2, (18 more...)

#artificialintelligence

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Communications > Mobile (0.54)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.34)

Add feedback

Filters

Collaborating Authors

blue box

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection - Supplemental Material - 1 More Ablation Studies 1.1 Number of test categories in evaluation

CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection - Supplemental Material - 1 More Ablation Studies 1.1 Number of test categories in evaluation

c12dd3034259fc000d80db823041c187-Supplemental-Datasets_and_Benchmarks.pdf

Representations of Fact, Fiction and Forecast in Large Language Models: Epistemics and Attitudes

From Goal-Conditioned to Language-Conditioned Agents via Vision-Language Models

Text2Motion: From Natural Language Instructions to Feasible Plans

Learning Action Duration and Synergy in Task Planning for Human-Robot Collaboration

Getting Connected with Google Home Using API.AI & Talend

Introducing CS to Newcomers, and JES As a Teaching Tool

Practice Programming Through Play